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Abstract 

We consider the problem of bounded-error quantum state identification: given either state 
ao or state a±, we are required to output '0', '1' or '?' ("don't know"), such that conditioned 
on outputting '0' or '1', our guess is correct with high probability. The goal is to maximize the 
probability of not outputting '?'. We prove a direct product theorem: if we're given two such 
problems, with optimal probabilities a and b, respectively, and the states in the first problem 
are pure, then the optimal probability for the joint bounded-error state identification problem 
is 0{ab). Our proof is based on semidefinite programming duality and may be of wider interest. 

Using this result, we present two exponential separations in the simultaneous message pass- 
ing model of communication complexity. Both are shown in the strongest possible sense. First, 
we describe a relation that can be computed with O(logn) classical bits of communication in 
the presence of shared randomness, but needs f^n 1 / 3 ) communication if the parties don't share 
randomness, even if communication is quantum. This shows the optimality of Yao's recent 
exponential simulation of shared-randomness protocols by quantum protocols without shared 
randomness. Second, we describe a relation that can be computed with Oilogn) classical bits 
of communication in the presence of shared entanglement, but needs logn) 1 / 3 ) communi- 

cation if the parties share randomness but no entanglement, even if communication is quantum. 
This is the first example in communication complexity of a situation where entanglement buys 
you much more than quantum communication does. 
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1 Introduction 



1.1 Bounded-error quantum state identification 

Suppose we are given one of two mixed quantum states, ao or a\, each with probability 1/2. We 
know what ao an d oti are. Our goal is to identify which one we are given. It is well known that 
we can output the correct answer (0 or 1) with probability 1/2 + ||«o — ai|| tr /2, where || • \\ tr is the 
trace norm (the sum of the singular values, divided by 2). This is optimal. In particular, if ao and 
a\ are very close in trace norm, the best measurement will do little better than a fair coin flip. In 
some situations, however, we cannot afford to output the wrong answer with such high probability, 
and would rather settle for a measurement that sometimes claims ignorance, but that is usually 
correct in the case where it does give an output. 

To illustrate this, suppose the states involved are the following pure states: 

|ao) = y/E\0) + vT^a|2) 
jai) = y/a\l) + VT^\2) 

If we cannot afford to make a mistake at all, it is clear what measurement we should apply: measure 
in the computational basis, and if the outcome is the state must have been ao; if the outcome is 
1 the state must have been a.\\ if the outcome is 2 we claim ignorance. Note that the probability 
of getting an answer (0 or 1) for the identification problem is now only a. We have thus increased 
our confidence in the answer, at the expense of decreasing the probability of getting an answer at 
all. Now consider a slightly more "fudged" example, for some small e: 

|oo) = y/(l-e)a\ 0) + y/ea \l) + y /T=a \2) 
jai) = y/m\0) + y/{l - e)a\l) + v / T^|2) 

If we apply the same procedure as before, we have now a small probability of error: on both states 
our measurement outputs a guess (0 or 1) with probability a, and if we output a guess, then that 
guess is wrong with probability only e. If e is sufficiently small, this may still be acceptable for 
many applications. 

More generally, let A be some classical random variable, and B be another random variable 
whose range includes the special symbol '?'. We call B an (a, e) -predictor for A if Vi[B ^ ?] > a 
and Vi[A = B\ B^?]>1 — e. For example, the above measurement applied to state ax where 
X is a random bit, gives us an (a, e)-predictor for X if we interpret output 2 as '?'. Motivated by 
the above examples — and by our applications in later sections — we define the bounded-error state 
identification problem: 

Given a register containing ax, with X a uniformly random bit, and an e > 0, what 
is the maximal a for which there exists a quantum measurement on the register whose 
outcome is an (a, e)-predictor for XI 

We use D £ (ao,ai) to denote the maximal value a. We stress again that the error probability is 
a conditional probability, conditioned on actually outputting a guess for the bit (0 or 1). Unlike 
the straightforward distinguishing problem, where the optimal success probability is determined 
by the trace distance ||ao — aiL r , we do not know of any simple metric on density matrices that 
determines the value D £ (ao, a±). However, as was also noted by Eldar one can easily express 
quantities like this as the optimal value of a semidefinite program, as we will do in Section El 21 

Now suppose we are given another identification problem in a second register, quantum state 
Py for a random bit Y, and suppose b = D £ (/3q,/3i) is the largest value for which we can obtain a 
(6, e)-predictor for Y. We now want to determine the optimal probability with which we can identify 
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(again with error at most e or something related) both states simultaneously. That is, what is the 
maximal probability p = D £ (ao (g) Po,ao <8> /?i,ai (8> Po,ot\ ® Pi) such that a joint measurement on 
ax ® /?y gives us a (p, e)-predictor for XY? Since the two registers are completely independent, it 
seems there is nothing much better we can do except applying the optimal measurement for both 
registers separately. 1 Thus our intuition suggests that p < ab, or at least p < 0(ab). This problem 
has a flavor similar to "direct product theorems" in computational complexity theory, where one 
is usually interested in k > 2 independent instances of some computational problem, and the aim 
is to show that the overall success probability of some algorithm for the fc-fold problem is close to 
the product of the k individual success probabilities. Another problem with a similar flavor is the 
notoriously hard quantum information theory issue of multiplicativity of norms of superoperators 
under tensor product |16j . 

Proving our intuition actually turned out to be quite a hard problem, and we indicate some 
reasons why in Section fc.ll In an earlier preprint [121 we were only able to prove it for e = 0, which 
was then used by us in [T3| and to obtain various zero-error separations in communication 
complexity. The present paper supersedes all of these unpublished results and gives in Section |3] 
the first proof of the p < 0(ab) bound for the case where at least one of the two sides is pure (i.e., 
ao and a\ are both pure, or (3q and P\ are both pure). More precisely, we show 

D e/2 (a ® fa, a <g> fa, ai <g> Po, a x <g> ft) < 0(D e {a Q , a x ) ■ £> e (/3 , Pi))- (1) 

Notice that because of the e/2 on the left hand side, this bound is slightly weaker than what we 
have promised; as we indicate in Section r3.11 this modification is (somewhat surprisingly) necessary. 
Our proof relies heavily on a semidefinite programming formulation for the quantities involved and 
on an analysis of their duals. 

1.2 Exponential separations in communication complexity 

Apart from being an interesting information theoretic problem in its own right, the bounded-error 
state identification problem and our direct product theorem have interesting applications. We 
give two new exponential separations, both in the simultaneous message passing (SMP) model 
of communication complexity. The area of communication complexity deals with the amount of 
communication required for solving computational problems with distributed input. This area is 
interesting for its own sake, but also has many applications to lower bounds on circuit size, data 
structures, etc. The simultaneous message passing (SMP) model involves three parties: Alice, Bob, 
and a referee. Alice gets input x, Bob gets input y. They each send one message to the referee, 
to enable him to compute something depending on both x and y, such as a Boolean function or 
some relational property. The cost or complexity of a communication protocol is the length of the 
total communication for a worst-case input, and the complexity of a problem is the cost of the best 
protocol that solves our problem with small error probability. 

The SMP model is arguably the weakest setting of communication complexity that is still in- 
teresting. Even this simple setting is not well understood. In the case of deterministic protocols, 
the optimal communication is determined by the number of distinct rows (and columns) in the 
communication matrix, which is a simple property. However, as soon as we add randomization to 
the model things become much more complicated. For one, we can choose to either add shared 
(a.k.a. public) or private randomness. In other communication models this difference affects the 
optimal communication by at most an additive O(logn) |22j . but in the SMP model the difference 
can be huge. For example, the equality function for n-bit strings requires about y/n bits of com- 
munication if the parties have only private randomness [Tl 1231 12]. but only constant communication 

1 This actually gives slightly worse error 2e — e 2 for the prediction of XY, so potentially it could be that p <C ab. 



2 



with shared randomness! No simple characterization of SMP communication complexity with either 
private or shared randomness is known. 2 

The situation becomes more complicated still when we throw in quantum communication. 
Buhrman et al. 8 exhibited a quantum protocol for the equality function with O(logra) qubits 
of communication. This is exponentially better than classical private-randomness protocols, but 
slightly worse than shared-randomness protocols. Roughly speaking, their quantum fingerprinting 
technique may be viewed as replacing the shared randomness by a quantum superposition. 

1.2.1 Shared randomness beats quantum communication 

The fingerprinting idea of |Hj was generalized by Yao |2H], who showed that every classical shared- 
randomness protocol with c-bit messages for a Boolean function can be simulated by a quantum 
fingerprinting protocol that uses 0(2 4c logn) qubits of communication. This has since been im- 
proved to 0(2 2c logn) qubits |131 114] . In particular, every 0(l)-bit shared-randomness protocol 
can be simulated by an 0(logn)-qubit quantum protocol. Again, quantum superposition replaces 
shared randomness in this construction. 

This raises the question whether something similar always holds in the SMP model: can every 
classical shared-randomness protocol be efficiently simulated by some protocol that sends qubits 
but shares neither randomness nor entanglement? Since the appearance of Yao's paper, quite a 
number of people have tried to address this. Our first result, presented in Section^ gives a neg- 
ative answer to this question. Suppose Alice receives inputs x, s € {0, 1}™ with the property that 
s has Hamming weight n/2 and Bob receives input y £ {0,1}™. The referee should output, with 
probability at least 1 — e, a triple (i,Xi,yi) for an i satisfying s« = 1. We prove that protocols 
where Alice and Bob share randomness can solve this task with O(logra) classical bits of commu- 
nication, while every bounded-error quantum protocol without shared randomness needs f2(ra 1//3 ) 
qubits of communication. The quantum lower bound relies crucially on our direct product theorem 
for bounded-error state identification. This shows for the first time that the resource of shared 
randomness cannot be efficiently traded for quantum communication. 

Yao's exponential simulation can be made to work for relations as well, and our quantum lower 
bound shows that it is essentially optimal, since the required quantum communication is expo- 
nentially larger than the classical shared-randomness complexity for our relational problem. We 
expect a similar gap to hold for (promise) Boolean functions as well. Our separation complements 
a separation in the other direction: Bar-Yossef et al. ;3j exhibited a relation where quantum SMP 
protocols are exponentially more efficient than classical SMP protocols even with shared random- 
ness (also in their case it is open whether there is a similar gap for a Boolean function). Accordingly, 
the quantum SMP model is incomparable with the classical shared-randomness SMP model. 

1.2.2 Shared entanglement beats quantum communication with shared randomness 

The second application of our state identification result is again in the SMP model. While the 
previous application separated classical protocols with shared randomness from quantum protocols 
without shared randomness, this one separates classical protocols with entanglement (EPR-pairs, 
2-qubit states of the form ^(|00) + |11))) from quantum protocols with shared randomness. 

The additional power that prior entanglement gives is one of the most fundamental questions in 
quantum communication complexity. This additional power is not well understood. We basically 
know two ways in which entanglement can help: it can be used for teleportation (where one 

2 Kremer et al. |18| claimed a characterization of shared-randomness complexity as the largest of the two one-way 
complexities, but Bar-Yossef et al. flf Section 4] exhibited a function where their characterization fails. 
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EPR-pair and two classical bits of communication replace one qubit of communication) and it 
can be used for shared randomness (if Alice and Bob each measure their side of their shared 
EPR-pair in the computational basis, they get the same random bit). Neither saves very much 
communication, and it has in fact been conjectured for the standard two-party one-round and 
many-round protocols that the model of classical communication with entanglement [S| and the 
model of quantum communication without entanglement |27j are essentially equivalent. 

Our second separation shows that the situation is very different in the simultaneous message 
passing model. We show that the qubit-communication model cannot efficiently simulate the entan- 
glement model. In Sectional we exhibit a relational problem, inspired by the problem of Bar-Yossef 
et al. mentioned above, that can be solved with logn EPR-pairs shared between Alice and Bob 
and O(logn) classical bits of communication. In contrast, if only shared randomness is available 
instead of entanglement, every bounded-error SMP protocol needs 0((n/ log n) 1 / 3 ) quantum bits 
of communication. Again, our direct product theorem is crucial for proving the quantum lower 
bound. This is the first example of a communication problem where entanglement is much more 
useful than quantum communication. 

2 Preliminaries 

2.1 Quantum computing 

The essentials needed for this paper are quantum states and their measurement. First, an m-qubit 
pure state is a superposition \<j)) = ^ze{o i} m a A z ) over an classical m-bit states. The a z s are 
complex numbers called amplitudes, and ^ z \a z \ 2 = 1. Hence a pure state \4>) is a unit vector 
in C 2 "\ Its complex conjugate (a row vector with entries conjugated) is denoted (4>\. The inner 
product between \(f>) and \ip) = ^2, z fi z \z) * s * ne dot product (<f>\ ■ \ip) = (4>\tp) = ^2 z ct* z (3 z . The norm 
of a vector v is \\v\\ = y {v\v). Second, a mixed state p = XrfPil&X&l corresponds to a probability 
distribution over pure states, where |<fo) is given with probability pi. A /c-outcome positive operator- 
valued measurement (POVM) is given by k positive semidefinite operators Ei, . . . , E^ with the 
property that X^;=i E% = I- When this POVM is applied to a mixed state p, the probability of the 
i-th outcome is given by the trace Tr[Eip\. We refer to Nielsen and Chuang [21] for more details. 

2.2 Communication complexity 

We now give a somewhat informal description of the simultaneous message passing model discussed 
in our two applications. For a more formal description, we refer to Kushilevitz and Nisan ^Uj for 
classical communication complexity and to the surveys |17l 12S] f° r the quantum variant. In the 
simultaneous message passing model, Alice receives input x, Bob receives input y, they each send 
a message to a referee who should then output either f(x,y) in the case of a functional problem, 
or an element from some set R(x, y) in the case of a relational problem. We use 
i?|' en *(P) to denote, respectively, the optimal communication complexity of classical protocols that 
solve problem P with worst-case error probability e, using, respectively, private randomness, shared 
randomness between Alice and Bob, and shared entanglement between Alice and Bob (EPR pairs). 
The number of shared coin flips or shared EPR-pairs is unlimited and does not count towards the 
communication cost of the protocol. We use Ql(P), Q l t pub (P), Q\' ent {P) for the variety that allows 
quantum communication. 
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2.3 The random access code argument 

Here we will describe a slight extension of a quantum information theory argument due to Ashwin 
Nayak j^J that we will apply several times in our communication complexity lower bounds. We 
call this the "random access code argument". In the following, we assume familiarity with basic 
classical information theory ^U] and quantum information theory |24| . 

Lemma 1 ["Random Access Code Argument"] Let X = X\ . . . X n be a classical random variable 
of n uniformly distributed bits. Suppose for each instantiation X = x we have a quantum state M x 
of q qubits. Suppose also that for each i £ [n] of our choice we can apply a quantum measurement 
to Mx whose outcome is a (Xi,£i) -predictor for AQ. Then 

n 

5^A<(1-H(e0)<g- 

i=i 

Before giving the proof, notice the following special case: if we can predict each AQ with bias rji 
(i.e., we have a (1, 1/2 — ^-predictor), then the above bound becomes 

n 

£(1 ~#(l/2 -TH))<q. 
i=i 

Since 1 — H(l/2 — rji) = @(rjf), the left hand side is essentially the sum of squares of the rji. 

Proof: First, let Y be a classical random variable corresponding to a uniformly distributed bit. 
Let B be another random variable that is a (A, e)-predictor of Y . Using H(Y | B,B ^ ?) < H(e) 
and Pr[B ^ ?] > A, we can upper bound the entropy of Y given B: 

H(Y | B) = Pt[B = ?]-H(Y\B,B = ?) + Pt[B^?]-H(Y\B,B^?) 

< (1 - Pv[B / ?]) • 1 + Pv[B / ?] • H(e) < 1 - A(l - H{e)), 

and hence lower bound the mutual information between Y and B: 

I(Y : B) = H(Y) - H(Y \ B) > A(l - H{e)). 

Now let Bi be the outcome of the measurement corresponding to i applied to Mx- We have 

S(Xi : M x ) > I(Xi : B { ) > \(1 - ff( Ei )) 

by Holevo's theorem ^5] (the left hand side is equal to the Holevo x-quantity). 
Using |24l Theorem 11.8.5] we have 

S(X : M x ) = S(X) + S(M X ) - S(X, M x ) = S(M X ) - £ S(M X ) < S(M X ) < q. 
Abbreviating Xi-i—i = X% . . . Xi—i, a chain rule for mutual information gives 

n 

S(X :M X ) = Y, S ( X i : M x I 

Using strong subadditivity and the fact that S{Xi \ X\.^{) = S(Xi) we get 
S(Xi : M x | = S(Xi | - S(Xi \ M X X X ^ X ) > S(X t ) - S(X 4 | M x ) = S(X 4 : M x ). 

Combining our inequalities gives the desired lower bound on g. ■ 
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3 Bounded-error quantum state identification: Direct product 



3.1 Why this is delicate and non-trivial 

We briefly recall the 2-register state identification problem from the introduction. In the first 
register we are given a quantum state ax, with X a random bit, and the optimal probability with 
which we can get an e-predictor for X is a. In the second register we're given /3y, with Y a random 
bit, and the optimal probability with which we can get an e-predictor for Y is b. We now want to 
know the optimal probability p with which a joint measurement on both registers can obtain an 
e-predictor for XY. As mentioned in the introduction, intuition suggests that p < 0{ab). Before 
proceeding to prove a slightly weaker form of this statement (namely the special case where ao 
and ai are pure), we will pause to sketch two variants of the problem where the same intuition 
is provably false, even for pure states! This points to the subtleness of the state identification 
problem: seemingly small changes to the setup change everything. 

First, suppose that instead of an e-predictor for XY we want an e-predictor for the parity X®Y 
of the two bits. This might be slightly easier than getting both bits separately, but intuition still 
suggests that because both registers are independent, the best we can do is predict both registers 
separately and output their parity if both measurements gave an answer. So we expect p < 0(ab). 
However, this intuition is false. Consider the following counterexample, with 5 very small: 

l«o) = |A>> = |o> 

|a 1 ) = |/3 1 ) = V / T^|0) + ( 5|l) 

It is not hard to convince oneself 3 that for any fixed e < 1/2, the optimal a and b are 0(<5 2 ), 
so our intuition suggests p < O(ab) = 0(5 4 ) for the parity problem. However, if we apply the 
measurement with operator E$ that projects onto the state yg=p (<?|0Q) — 1 01) — 1 10) ) , E\ = 0, and 
E? = I — Eq, then on the parity-0 inputs ao ® A) and ai ® f3\ the measurement gives outcome 
with probability roughly 5 2 , while on the parity- 1 inputs it gives outcome with probability only 
about 5 e <C s5 2 . Thus, in this example p is of the same order as a and b instead of their product. 

In our second example, we return to the original setting where we want to obtain a predictor 
for XY (not their parity). We consider the case where in the left hand side of Eq. (^Q) from the 
introduction we replace e/2 with a slightly larger error parameter. Surprisingly, we show that in 
this case the bound p < 0(ab) is false. Choose e to be, say, 0.49, and replace e/2 in the left 
hand side of (^Q) with something slightly larger, say, 0.251. 4 To construct this example, we use 
the same states as in the previous example. For our choice of e, we still have a,b = @(S 2 ). Now 
consider the measurement where operator Eqq projects onto the state _J=(6|00}-§|01)-§|10)), 

Eq\ = Eiq = En = 0, and E? = I — Eqq. Then on the state ao <8> /?o we get outcome 00 with 
probability roughly 9<5 2 /8, while on each of the other three states this probability is roughly 5 2 /8. 
Conditioned on outputting an answer, our error probability is roughly (3/8)/(9/8 + 3/8) = 1/4, so 
we obtain a 0.251-predictor for XY . We see that again, contrary to our intuition, p is of the same 
order as a and b. 

Finally, to get a better feel for this problem and for why it is non-trivial, let us consider the 
classical case. This is the special case of the problem in which all states involved are classical 
probability distributions. In other words the density matrices ao, ai are diagonal in the same basis 
and similarly for /?o, 0i. 5 In this case, one can give a characterization of the optimal measurement. 
Let ao (resp., ai) correspond to some probability distribution on n elements with probabilities 

3 A rigorous proof can be obtained from the SDP formulation of this problem. 
4 With some effort, this example can be generalized to other values of e. 
5 This is related to optimal detector design, see e.g. |25| . Section 7.3. 
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pi, . . . ,p n (resp., qi, . . . , q n ). Assume without loss of generality that the n elements are sorted by 
non-increasing order of max{pi/(pi + qi),qi/(pi + qi)}- For any k > 1, consider the measurement 
that maps the outcome i for 1 < i < k to either if pi > q^ or 1 otherwise, and maps any outcome 
i > k to '?'. This means that for each i < k we output the guess (ao or ai) that is more likely, 
conditioned on i. Note that max{pj/ (p^ + qi/{pi + %)} represents the probability that our guess 
is correct, given i. Then, for any error parameter e, one can show that the best measurement is 
obtained by taking k as large as possible while still keeping the error probability of the resulting 
measurement below e. 6 

Now assume we have probability distributions ao , «i , A) > Al (equivalently, diagonal matrices) and 
we want to predict XY based on a sample from ax <8> Py (the tensor can be described classically as 
one sample from ax together with one independent sample from Py)- The optimal measurement 
in the two-register case can be obtained by a straightforward generalization of the measurement 
we have described in the single register case. As mentioned in the introduction, one might expect 
the optimal measurement to use the first register to predict X and the second register to predict Y 
separately, i.e., to be a tensor product measurement. It is perhaps somewhat surprising that this 
is not true in general, as can be seen using some simple examples. The intuitive reason for this is 
that if a sample from ax <S> Py is such that i gives a very strong indication of (say) ao, then 
we might be willing to predict the state ao <8> A) even if j gives only a weak indication of /3q. 

Nevertheless, the direct product theorem of Eq. (^Q) does hold in the classical case, even when 
we replace e/2 with e. One proof of this is based on a similar approach to the one we will take in 
the quantum case: first, formulate the problem in terms of linear programs (which are very similar 
to the semidefinite programs that arise in the quantum case) and then bound the dual solution of 
the joint system. Bounding the dual solution is the most demanding step technically, and amounts 
to solving some inequalities on real numbers. In the general quantum case, this step involves some 
(rather nasty) matrix inequalities that seem quite difficult to solve. In the special case that we 
consider below, these matrix inequalities turn out to have a sufficiently nice form to be analyzed. 

3.2 Proof of the direct product theorem 

In this section we prove our main results about the 2-register quantum state identification problem. 
We use the powerful technique of semidefinite programming duality. For details on semidefinite 
programming, see e.g. |201 I25|. We will actually prove two bounds. First, for the case where ao, 
ai are pure and Po, Px are unrestricted, our Theorem ^ implies 

D £/2 (a ® p , a ® Pi, a x ® p , a x ® Pi) < O(D £ (a , a x ) ■ D £ (p ,pi)). (2) 

Second, if we allow all of ao, ai, 0o, Pi to be mixed states then our Corollary ^ gives 

D £ / 2 (a ® Po,a <g> Pi, ai ® /3 ,«i ® Pi) < O(\\a - ai\\ tr ■ D £ (p ,Pi)). 

The second bound will follow from the first by purifying the mixed states ao and a\. 

Let us first characterize D £ (ao,ai). Recall that any measurement whose outcome is an (a, e)- 
predictor outputs the correct answer with probability at least 1 — e conditioned on outputting a 
guess (0 or 1, but not ?). Denote the three measurement operators by Eq, Ex, E?. Then we require 

Prfwrong guess] ±Tr[£ «i] + ^Trf^iao] 

e > Pr wrong guess guess = — = = , (3 

PqguessJ Tr [(Eq + E\)a\ 

6 To be precise, we should also allow non-integer k in the sense that when the outcome is \k~\ , one should output 
either or 1 (depending on whether > q[k]) with probability k — [kj and '?' otherwise. 
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where a = \{ocq + a i) is the average state. To our knowledge there is no simple expression for 
D e (ao, «i) in terms of cto and a\. However, one can easily express it as a solution to a semidefinite 
program (SDP). For fixed density matrices Qo, en and fixed e G [0,1/2), the optimal value a = 
D £ (oto,ai) is given by the following SDP: 

maximize Tt[(Eq + Ei)a] 

subject to 0^Eq,Ei, . . 

E + Ex ± I, U 
iTr[^ ai] + ^Tr[^ia ] < eTr[(E + E{)a]. 

The first two constraints state that the operators Eq,Ei together with a third operator E? = 
I — Eq — Ei form a valid quantum measurement. The last constraint bounds the conditional error 
probability, as in Eq. (J3J). An analogous SDP can be written for b = D £ (/3 , j3\). 

Similarly we can write the primal SDP that optimizes p = D £ (oeo ®Po, uq <8>/?i, ol\ <8> A)j cti (g> (3i): 



(5) 



maximize Tr^Ebo + Eqi + E w + E\\) a (g) 0\ 
subject to ^ E 00 , E 01 ,E 10 , E n , 

Eqo + Eoi + £io + En ^ I, 

±Tr [(£ Q i + Eva + En) «o ® Po + (^oo + E w + E u ) a ® (3i+ 
(£ o + E i + En) ai ® ft + (Eoo + #oi + #io) «i 0i] 
< eTr[(Eoo + En + E w + En) a ® /?]. 

Here a® j3 = \(ao <8> A) + «o <8> /3i + ai <8> /?o + Q<1 <S> A) is the average state. 

Theorem 1 Lei < e < ^ and ao,cn, /3o,(3i be density matrices, where ao,ai correspond to pure 
states |ao), |cn). Let b = D £ (f3o, fii) and p = D £ / 2 (oto <8> A), ao ® cn <8> /?o, «i <8> /?i). T/ten 

p< 16(1 - |<a |«i)| 2 ) • 6. 

Since ao and ai are pure, a = L> e (ao, «i) > Do(ao,a>i) > |(1 — |(ao|cn)| 2 ), where the last inequality 
follows by considering the projective measurement on \ao) and \c(q). Hence this theorem implies 
Eq. ©. 

Proof: The idea behind our proof is the following. As we observed before, both b and p are the 
solution of an SDP and so any feasible solution of the corresponding dual SDP yields an upper 
bound to b resp. p. We will show that a feasible solution with value df, > b for the dual for b can 
be used to construct a feasible solution with value 16(1 — |(ao|cn)| 2 ) • db for the dual for p. This 
value then upper bounds p. The dual SDP for b is strictly feasible in our case, which means that 
we can make db as close to 6 as we want. This implies the theorem. 

Let 5 := y/l — |(ao|«i}| 2 - Then we want to show p < 165 2 b. The dual SDP for b is 

minimize Tr[X;>] 

subject to Xb >z 0, Zb > 0, , , 

X b hU(l+ ez b )(3o + (1 - (1 - e)zb)f3i) =: X x , W 
X b h \ ((1 + ez b )l3i + (1 - (1 - e)zb)(3o) =: X 2 . 

This SDP is strictly feasible, for example, zj, = \-,Xb = 21 is a strictly feasible solution. Hence by 
strong duality its optimal value is exactly b. 
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The dual SDP for p is 



minimize Tr[X] 

subject to X y 0, z > 0, 

X h I ({(1 + |z)a + (1 - (1 - §»ai } ® A) + (1 " (1 " §K>K + ai) ft) =: X[, 
X h I ({(1 + |z)a + (1 - (1 - §)*)ai } ® A + (1 - (1 - f K>K + ai) ft] =: 
X >= J (1(1 + §2)ai + (1 - (1 - §)*)a } <8> ft + (1 - (1 - §)z)(a + «i) ® ft) =: X£, 
X >= I ({(1 + §2)ai + (1 - (1 - f»a } ® ft + (1 - (1 - §)2)(ao + «i) ® ft) =: 

(7) 

For what follows we need to define the positive part of a Hermitian matrix. Any Hermitian matrix 
A can be written uniquely as A = A + — A~ , where A + , A~ are positive semidefinite (^4 + , A~ >z 0) 
and have orthogonal support. Then define Pos(A) = A + . We need the following simple properties: 

Claim 1 1. If A<B then A < Pos(S). 

2. IfAtO then Pos(A ® B) = A ® Pos(S) . 

3. If A<B then Tr[Pos(A)] < Tr[Pos(B)]. 

Note that it is not true that if A ■< B then Pos(A) ■< Pos(5). 

Proof: The first part follows from B X Pos(-B). The second part can be seen by diagonalizing the 
matrices (note that the non-zero eigenvalues of Pos(B) are exactly the positive eigenvalues of B). 
The third part can be seen for instance by using majorization (see e.g. (Hj). If A ^ B, then the 
vector of eigenvalues of A is submajorized by the vector of eigenvalues of B (jSJ, Eq. (11.16), Ky Fan 
Maximum Principle). This means that if we order the eigenvalues of A (resp. B) as Ai > A2 > ■ ■ ■ 
(resp. p± > /i2 > • • •) then for all k > 1, Yli=i — Yli=i Pi- Together with the fact that the trace 
of Pos(^4) is the sum of the positive eigenvalues of A, the property follows. ■ 

We also need the following technical claim, which we will prove afterwards: 

Claim 2 Let < e < 1/2 and ctq, a±, po, p\ be density matrices, where po and p\ are 2-dimensional 
of rank 1 (i.e., pure states). Denote by p^ = I — p\ the rank 1 density matrix whose support is 
orthogonal to that of p\. Then for all z b > there exists z = z{e, z b ) > such that 

^ 2 p{®\ {(1 + ez b )a + (1 - (1 - e)z b )a 1 } 

- 4 ({ (1 + \ Z)PQ + (1 ~ (1 ~ £ 2 )z)pl } ® ff o + (1 - (1 - l»(po + Pi) ® ffi) • 

Fix a dual solution (X b , z b ) for ©. Our goal is to find a feasible solution (X, z) to Q such that 
Tr[X] < 165 2 Tr[X;,]. Since \ao) and \a>i) are pure states, we can assume without loss of generality 
that they are in a two dimensional space, and therefore we can apply Claim[2]with p = «0i Pi = a i> 
Co = ft and o"i = ft. Let 

Yi = 45 2 ai ® ~ {(1 + £2 6 )ft + (1 - (1 - e)z 6 )ft} = 45 2 af ® Xi. 

Claim|2lgives & z = z(e, z b ) such that Y\ >z X[ (see (J2J) for the definition of X[). Note that because 
aj y we can use Claim ^2: 

Pos(yi) = A5 2 ai ® Pos^ {(1 + ez b )p + (1 - (1 - e)z h )fc} = A5 2 a{ ® Pos(Xi). 
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Because a{ h 0, Tr[Pos(Yi)] = 4<5 2 Tr[Pos(Xi)]. Moreover, X 1 < X b by definition (see ®) and 
X b = Pos{X b ), hence Tr[Pos(Yi)] < 4£ 2 Tr[Pos(X 6 )] = 45 2 Tr[X b } (using Claim[Tj3). 

However, Pos(Yi) is not a solution of the dual SDP in (0) because it need not satisfy the 
last three inequalities. We construct three more matrices Y2, Y3 and I4 such that Yi y X- for 
the same z as before. For this we apply Claim [21 three more times (for Y2 = A5 2 a^ X2 with 
(p Q) pi,cro,<Ji) = (a ,a 1 ,f3 1 ,(3o), for Y 3 = A5 2 a^ ® X x with {p ,pi, a , <J\) = (ai, a , (3 , and for 
I4 = 4<5 2 o^J- <8>X 2 with (/9 , Pi, o" , cii) = (ai, «o> /$i> A)))- Because z depends only on z b and e, which 
are the same in all four applications, we obtain each time the same z. Now define X = J2 i=1 Pos(Yi). 
Clearly (X, z) is a feasible solution to the SDP (J7J) since X >z by definition and X >z Pos(Yj) >z X[ 
for i = 1 ... 4 (using Claim[Ul). But Trpf] = Y,t=i Tr[Pos(Yi)] < 16<5 2 Tr[X b ]. As Trpf] is an upper 
bound on p, and TrfA^] can be made arbitrarily close to b, this implies the theorem. ■ 

Proof of Claim |2J Because oq and a\ are positive semidefinite, it suffices to find a z > for 
which the equations 

U 2 P i\(l + ez b ) h\{(l + £ -z)po + (1 - (1 - |)z)pi} (8) 

and 

4*W±(1 - (1 - B )*) t 1(1 - (1 - £ -)z)( PQ + Pl ) (9) 

are true. 

Let \po), \pi) and \p^} be pure states whose density matrices are po, p\ and pj~. We choose their 
global phase such that \po) = \/l — S 2 \pi) + 5\pj- }. Then, in the basis given by |/0f ), Eqs. (jHJ 
and @ become 



-(l + §z)(l -£ 2 )- (l-(l-§)z) -ivT^Cl + fz) 

-<5v / T T7 5 2 (l + f^ " 85 2 (l + ez 6 ) -£ 2 (l + fz) 

z(l-e + 5 2 f) + 5 2 -2 -*vT=**(l + §*) . 
-<5v / T 3 5 2 (l + § z) <5 2 (7 + 8ez b - §z) 



and 



-(1 - (1 - §»(2 - J 2 ) -5v / T^(l - (1 - §)*) 



^vT^J 2 (l - (1 - §)*) 85 2 (1 - (1 - e)^) - 5 2 (1 - (1 - §)*) 

((l-|)z-l)(2-5 2 ) tfvT^Kl - §)* - 1) V n nn 
(tyT^((l - f )* - 1) 5 2 (7 - 8(1 - eH + (1 - §)z) ^ " 1 ) 



To show that a 2 x 2 Hermitian matrix is positive semidefinite it suffices to show that both its 
determinant and at least one of its diagonal entries are positive. We choose 

1-e 4 
z = 16 — z b + 



l-s/2 1-e 

Since z > 4, the upper diagonal entries of the matrices in Eqs. (|1U|) and (|11[) are positive. Moreover, 
if (5 = these matrices are trivially positive. If 5 > then we can cancel 5 2 > from both terms 
that appear in their determinants. Hence, for Eqs. (|l(Jj) and 1)11(1 to be true it suffices to show 

(z(l - e) - 2) (7 + 8ez & - |*) - (1 + |z) 2 > (12) 



10 



and 

(2 - <5 2 )((1 - £ -)z - 1)(7 - 8(1 - e)z b + (1 - |» - (1 - f)((l - -)z - l) a > 0. (13) 

To derive Eq. 1)12(1 we have replaced the term z(l — e + o~ 2 §) + S 2 — 2 by the smaller positive 
term z(l — e) — 2, which is legal because this equation is only true if 7 + 8ezb — §z > 0. Using 
(2 - o" 2 )/(l - 5 2 ) > 2 and (1 - §)z - 1 > 0, Eq. (T3J) is implied by 

2(7-8(1-e)z6 + (1-|) Z )>(1-|)z-1 

which is equivalent to 

1-e 15 



z > 16zb 



1 - - 1 — - 

1 2 1 2 



This inequality is true for our choice of z. It remains to show that our z satisfies Eq. ()12j) . 
Substituting for z we see that the quadratic term in z\, cancels and we obtain 

"-JT ± w) + MT L T-" e ) >0 - 

This linear inequality is satisfied (for Zf, > 0) because both its constant coefficient and the coefficient 
of Zb are positive for < e < \. ■ 

Using this result, we can as a corollary also prove a second, "asymmetric" direct product theorem 
when «o, «i and (3q, (3\ are all mixed states: 

Corollary 1 Let < e < 5 and ao, ai, A), A fre density matrices. Let a = \\oto — cti\\tr, b = 
D £ (/3q,Pi), and p = D £ / 2 (ao <8> 0o, ao &> ai (8) /3b, «l ® A). T/ien p < 32 a • 6. 

Proof: The idea is to work with purifications of ao and u\. By Uhlmann's theorem |24| p. 410] there 
exist purifications I do) and I ai) that preserve the fidelity, i.e., F(ao, a±) = F(|do), |5i}) = |(do|di)|. 
Using known properties of the fidelity |24[ Section 9.2.3], we have 

F(a.Q,ai) > 1 — ||qo — c*i\\tr = 1 — a. 

This implies 1 — |(do|di)| 2 < 2a. Let on = |dj)(dj|. Then, 

P = D £ / 2 (a <g> (3 ,a <g> 0i,ai <g> /3b, ai ® 0i) < D £ / 2 (a (g) f3 ,a g) /3i,di ® /3 ,di «> 0%) 

because one can obtain ao,«i by tracing out the purification degrees of freedom of do,di. Theo- 
rem H now gives p < 16(1 — |(do|di)| 2 ) • b < 32 a ■ b. ■ 

4 Shared randomness can be exponentially stronger than quan- 
tum communication 

4.1 The problem 

In this section we analyze the following communication problem Pi in the SMP model: 
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Alice's input: strings x, s £ {0, l} n , with Hamming weight |s| = re/2 
Bob's input: a string y E {0, l} n 

Goal: the referee should output (i,Xi,yi) for some i such that = 1 

We allow the referee some small constant error probability e < 1/8. In the next two subsections we 
show that this problem is easy if we have classical communication and shared randomness, and hard 
if we have quantum communication without shared randomness. More precisely, we will prove: 

Theorem 2 For the relational problem Pi defined above we have 

rIp^^) < O(logn) and Q\(Pi) > ^(re 1 / 3 ). 

4.2 Upper bound with classical communication and shared randomness 

Shared randomness gives the parties enough coordination to easily solve this problem. Alice and 
Bob just send (i,Xi,Si) and (i,yi), respectively, to the referee for log(l/e) public random i's. With 
probability 1 — e, Si = 1 for at least one of those i's and the referee outputs the corresponding 
(i,Xi,yi). With probability e he doesn't see an i for which Si = 1, in which case he outputs 
something random. Hence R\' pub (P) < 0(logrelog(l/e)). 

4.3 Lower bound for quantum communication with private randomness 

Consider some quantum protocol that solves our problem with error probability e < 1 /8, and where 
the messages that Alice and Bob send to the referee are at most q qubits long. Our goal is to show 
q > ^(re 1 / 3 ). 

First consider the mixed state message (3 y that Bob sends given input y. For i € [re], let 

y-Vi=o 

be the uniform mixture of all (3 y with yi = and define fin similarly. Let 6j = -D4 £ (/3io, fin)- Then 
by the random access code argument (Lemma ^) we have 

n 

Y,h{l-H{4£))<q. 

i=i 

By Markov's inequality, there is a set S of n/2 i's such that bi < 2q/n{\ — ff(4e)) < 0(q/n) for all 
i G S. We now fix Alice's input s to be the n-bit string with support corresponding to S. 

We now analyze Alice's message. Let a x be the mixed state she sends given input x and 
our fixed s. Define a^o as the uniform mixture of all a x with Xi = 0, similarly define an, and 
a i = \\a-io — o.n\\ tr . The optimal probability with which we can distinguish aio from an is ^ + y- 
The random access code argument gives 

n 
i=l 

Now we look at the protocol's behavior. Let X = X\ . . . X n and Y = Y\ . . . Y n be uniformly 
distributed random variables giving Alice's first and Bob's only input, and /, B\, B>2 be the random 
variables describing the referee's output. We call an index i £ [re] good, if the protocol is correct 
with high probability when it outputs (i, *, *): 
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i is good iff i £ S and Pr[£ a = X h B 2 = Y t \ I = i] > 1 - 2e. 

The index is called bad otherwise. Define p% = Pr[J = i] to be the probability that the referee 
outputs something of the form (i, *, *). Because the protocol is correct with probability at least 
1 — e, a Markov argument shows that the good indices must together have most of the probability: 

1 - e < J2 Pi + E 1 1 - 2e )^ = 1 - 2£ + 2e E 

good i bad i good i 

hence 

2 ^ E »• 

good i 

Notice that for each good i we can use the protocol to get a (pj, 2e)-predictor for XiYf just run the 
protocol and return '?' if the protocol's output is not of the form (i, *, *), and otherwise return the 
last two bits of the protocol's output. Therefore Corollary ^ implies pi < 0(aj6j). Also, hi < 0(q/n) 
for all good i so we can bound 



O 



n * — • / I n 

good i good i \ i=l 



ii 

n 

i=l 



7 3/2 



n 



1/2 



where we applied Cauchy-Schwarz in the fourth step. This implies q > f2(ra 1//3 ). 

Remark: The best no-shared-randomness protocol we know for Pi communicates 0(y/n) bits. 
The idea is to arrange the n-bit inputs in a \fri x y/n matrix. Alice picks a random row index 
in [\/n\, and then sends that index and the indexed row of x and of s to the referee. Bob picks 
a random column index in [y/n\, and then sends that index and the indexed column of y to the 
referee. The row and the column intersect in exactly one (uniformly random) point i S [n]. With 
probability 1/2, Sj = 1 and we are done. Repeating this a few times in parallel reduces the error 
probability to a small constant. A matching lower bound would follow from the general direct 
product theorem p < 0(ab), for the case of the 2-register identification problem where both sides 
are allowed to be mixed. 



5 Entanglement can be exponentially stronger than quantum com- 
munication with shared randomness 

5.1 The problem 

For n a power of 2, consider the following relational problem inspired by a one-way communi- 
cation problem due to Bar-Yossef et al. 

Alice's input: a perfect matching M C f^) an d a string x G {0, l}™/ 2 containing a 
bit x e for each edge e € M 
Bob's input: a string y G {0, l} n 

Goal: the referee should output (i,j,%nj),yi ® yj) for some edge € M 

Below we show that this problem is easy if we have classical communication and prior entanglement, 
and hard if we have quantum communication without entanglement: 

Theorem 3 For the relational problem P 2 defined above we have 

R\\> ent (P 2 ) < O(logra) and Ql' pub (P 2 ) > log n) 1/3 ). 
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5.2 Upper bound with classical communication and entanglement 

The following protocol solves the problem with success probability 1, using O(logn) classical bits 
of communication and logn EPR-pairs shared between Alice and Bob. It is a modification of an 
unpublished protocol due to Harry Buhrman [7j, which is in turn based on a one-way protocol 
from [3] . The starting state of Alice and Bob is 

4= E i*>k>- 

ie{o,i} lo s" 

Bob adds his bits as phases: 

Alice measures with the n/2 projectors Eij = + \ j){j\ induced by the n/2 pairs E M. 
This gives her a random € M and the resulting joint state of Alice and Bob is 

(KX-i)^K> + b->(-i)^U>) . 

Now both players apply a Hadamard transform to each of the log n qubits of their part of the state, 
which becomes (ignoring normalization) 

((-l)»*+(*+*H + (_i)W+(fc+<)-ij \k)\£). 

k,£ 

Note that \k}\£) has non-zero amplitude iff yi + {k + £) ■ i = yj + (k + £) ■ j mod 2, equivalently 

(k + £)-(i + j)=y i ®y j . 

Alice and Bob both measure their part of the state in the computational basis, obtaining some k 
and £, respectively, satisfying the above equality. Alice sends i,j,k, and xuj\ to the referee, Bob 
sends £; a total of O(logn) bits of communication. The referee calculates © yj from k,£ and 
outputs (i,j,xuj\,yi®yj) as required. 

5.3 Lower bound for quantum communication without entanglement 

We make use of some ideas from the classical lower bound of Bar-Yossef et al. For k £ 
{0, . . . , n/2 — 1}, let Mfc denote the matching {(i, (i + k — 1 mod n/2) + n/2 + 1}^. For example, 
Mi = {(l,n/2 + 2), (2, n/2 + 3), (3, n/2 + 4),... , (n/2 - 1, n), (n/2, n/2 + 1)}. We will prove our 
lower bound for the special case where Alice's matching is one of the M^. Consider a quantum 
protocol where Alice and Bob share randomness but no entanglement, each communicates at most 
q qubits to the referee, and they solve problem P2 with error probability e < 1/16 for each input. 
Our goal is to show q > f2((n/ logn) 1 / 3 ). 

We consider the following input distribution. Let K be a uniformly random number between 
and n/2 — 1, Mk be Alice's first input, and X G {0, l} n / 2 and Y G {0, 1}™ be uniformly distributed 
random variables for Alice's second and Bob's only input. Since the protocol has error at most 
e for all inputs, we can (and will) fix a value for the shared randomness such that the resulting 
protocol has average error at most e under the above input distribution. 

Let ockx be Alice's message on input Mk,x. For edge e = £ Mfc, define ctkeo as the uniform 
mixture of all a^x with x e = 0, similarly define a^ei, and a^ e = \\otkeQ ~ a kei\\t r - The optimal 
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probability with which we can distinguish akeo from akei is 1/2 + afc e /2. Hence for every k, the 
random access code argument (Lemma HJ gives 

eSM fe 

Let /3 y be Bob's message on input y. For any e = (not necessarily part of any matching), 

define f3 e o as the uniform mixture over all f3 y with m © yj = and similarly define f3 e i- Let 
6 e = Z?8e(/3e0) Pel)- We now prove two claims upper bounding sums of these b e . 

Claim 3 For any forest (i.e., acyclic graph) F on [n] we have b e < 0(q). 

Proof: Denote by \F\ the number of edges in F. For every e = € F we can obtain a (b e , 8e)- 
predictor for the bit © Kf given the g-qubit state /3y. Intuitively, since F is a forest, these \F\ 
bits are independent and therefore represent \F\ bits of information. To make this formal, define 
for each w S {0, 1}\ F \ the set 

T w = {y e {0, l} n | Ve = (i, j) eF, Vi (B Vj = w e }. 

Since F is a forest, {T w } we r n\F\ is a partition of {0, l} n into 2l F l sets of size 2 n ~\ F \ each. 

For any bit string w £ {0, 1}'^' we define £ w as the uniform mixture of (3 y over all y G T^. For 
each e £ F, define £ e o as the uniform mixture of £ w over all w with io e = and similarly define £ e i. 
Then, it is easy to see that £ e o = (3 e o and £ el = j3 e \. Hence, i^se^eOj Cei) = and by applying the 
random access code argument to the encoding of w as the g-qubit state we get 

Y,b e (l-H(8e))<q. 



n/2-1 

Claim 4 E E 6 e < 0(g 2 logn). 

fc=0 eeAf fc 

Proof: By construction all our M^'s are disjoint, hence the set M = U^Mfc contains each edge in 
the above sum exactly once. Making some bijection between edges in M and numbers I G [|Af|], 
we order the b e in non-increasing order as 

bi > b 2 > ■ ■ ■ > b\ M \. 

Now consider the graph consisting of the first I edges in this ordering. This graph must contain 
at least y/2£ non-isolated vertices, since v vertices give only — v 2 /2 distinct edges. Let F be 
a forest consisting of a spanning tree for each connected component of this graph. This F has at 
least V21/2 = \/£/2 edges, and for each of those edges e we have b e > be. Now we can use Claim EJ 




■h<Y,be<0{q). 



Hence for all I < \M\ we have 

be < 0(q/V£). 
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Summing over all £ gives 

\M\ n 2 /4 

E 6 e = E & ^E < 0(q 2 logn) 



eeM £=1 



Since the protocol has average error at most e, by Markov's inequality there is a set Ai of 
at least n/4 of our matchings such that the protocol has error at most 2e for that Mj. and 
uniformly random X and Y. Since Ai contains at least n/4 elements, Claim 0] implies there is a 
matching G Ai such that 



eSM fc 

We now fix this matching on Alice's side. Let I , J, B\, B 2 be the random variables giving the 
referee's output. Suppose we run the protocol with and uniformly random x and y as input. 
We call an edge good, if the protocol is correct with high probability when it outputs (i,j, *, *): 

e = (i, j) is good iff e G M k and Pr[#i = X e , B 2 = Yi Yj \ I = i, J = j] > I - 4e. 

The edge is called bad otherwise. Let p e = Pr[I = i, J = j] be the probability that the protocol 
outputs edge e. Since £ Ai, the success probability (averaged over x and y) is at least 1 — 2e, 
so by a Markov argument, the good edges must have most of the probability: 

1 - 2e < ^ Pe + E P^ 1 - 4e) = 1 - 4e + 4e ^ p e , 

good e bad e good e 

hence 

2 ^ E ^ 

good e 

For every good edge e, we can construct a (p e , 4e)-predictor for (X e , YiQYj). Hence, by Corollary^ 
Pe < 0(afc e 6 e ). Using Cauchy-Schwarz: 



bl\ < O 



' q 3 logn 



\ < ^Pe< E °(^) ^ J E 4e ■ E 
good e good e \ y good e good ( 

This implies the promised lower bound q > 0((n/ log n) 1 / 3 ). 

Remark: Our bound is tight up to logn factors. To see this, we briefly sketch a protocol which 
uses 0(n 1 / 3 logn) qubits of communication: Alice and Bob use their shared randomness to fix a 
subset S C [n] of size n 2 / 3 . With high probability the number of edges from M contained in S x S 
is roughly n 1//3 . For each of the edges 6 M D S x S, Alice sends {i,j,xuj\) to the referee, 

which is 0(n 1//3 logn) bits of communication. Bob prepares n 1 / 3 copies of the state 

^rE(-W) ( 14 ) 



and sends them to the referee. This gives a total of 0(n 1 / 3 logn) qubits of communication. On 
each of the copies, the referee measures with the projectors Eij = + \ j){j\ induced by the 
edges in S that Alice has sent, completed by E gar b a ge = I — Yl-^ij- Given the state in Eq. (|14|). 
the probability to not measure "garbage" is roughly n -1 / 3 . This means that with some constant 
probability the referee will measure one of the edges on one of the states Bob sent. This state 
then collapses to ^(( — + ( — an d a measurement in the basis \i) ± \j) gives y« © yj. 
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6 Conclusion and future work 



We studied the bounded-error quantum state identification problem and proved a direct prod- 
uct theorem for two independent instances of this problem (one involving pure states) using SDP 
duality. We applied our direct product theorem to obtain two exponential separations in the 
simultaneous message passing model of communication complexity. These two separations nicely 
complement each other: the first shows that shared randomness is much more powerful than private 
randomness, the second shows that prior entanglement is much more powerful than shared random- 
ness. Moreover, both separations are shown in the strongest possible sense: the stronger model is 
restricted to classical communication while the weaker model is allowed quantum communication. 

We identify some interesting problems left open by our work. First, for the bounded-error 
quantum state identification problem, prove the direct product theorem p < O(ab) in the general 
case where both sides have mixed states instead of one side pure and one side mixed. That result 
would lift, for instance, our quantum communication lower bound for the problem P\ to the optimal 
il(y/ri). Second, show similar communication complexity separations for decision problems (Boolean 
functions, possibly with a promise on the input) instead of for relational problems. Finally, we hope 
our direct product theorem will be useful for other applications as well. 
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